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Abstract — This paper derives an outer bound on the capacity 
region of a general memoryless interference channel with an 
arbitrary number of users. The derivation follows from a 
generalization of the techniques developed by Kramer and by 
Etkin et al for the Gaussian two-user channel. The derived bound 
is the first known outer bound valid for any memoryless channel. 

In Gaussian noise, classes of channels for which the proposed 
bound gives the sum-rate capacity are identified, including 
degraded channels and a class of Z-channels. 

Index Terms — Interference channel; Degraded channel; Outer 
bound; Sum-capacity; Z-channel; 

I. Introduction 

Determining the ultimate capacity limits of the general 
memoryless K-user InterFerence Channel (K-IFC) is an open 
problem since its inception. The network considered in this 
work is depicted in Fig. [T] it consists of K pairs of nodes 
and is defined by input alphabets (A'l, . . . , Xk), output al- 
phabets (3^1, . . . , and a channel transition probability 
Py-i,...,Yk\Xi,...,Xk- "^he only assumption on the channel is 
that it is memoryless. Source i G [1 : K], wishes to 
communicate to destination i din independent message Wi. A 
(e"^^^, . . . , e^^^, n, en) code consists of K encoding func- 
tions [1 : e^^^] Xl", K decoding functions Wi : J^f 
[1 : e^^^], such that Pr [^{¥1") ^ Wi] < e^, i G [1 : K]. 
A rate-tuplet {Ri^ . . . ^ Rk) is achievable if there exists a 



family of (e 



nRi 



e^^^ ^n^en) codes such that e^, ^ 



as n ^ oo. The capacity region is the convex closure of the 
set of achievable rate-tuplet . . . , Rr)- The capacity is not 
known in general. 

The goal of this paper is to derive an outer bound for the 
K-IFC that holds for any memoryless channel (not necessarily 
Gaussian) and for any K > 2. 

A. Past Work 

The capacity region of the 2-IFC is known if the interference 
is strong |1 1-|3 |, if the channel outputs are deterministic and 
invertible functions of the inputs |4), and if the channel has a 
special form of degradeness |j5|, |[6|. The largest known inner 
bound is due to Han and Kobayashi (HK) |7l and uses rate 
splitting and joint decoding. General outer bounds are due to 
Sato 1^, lH, and Carleial ^ (see also Kramer |[T0| Th.5]). 

For the Gaussian 2-IFC, the capacity region is fully known 
in strong interference only fSl, |TT|, p2|. T he sum-capacity 
is however known in mixed interference |13|, |T4| , for the Z- 
channel fTS] , and in very weak interference |T4| , |T6| , p7| . In 
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Fig. 1. The general memoryless InterFerence channel with K source- 
destination pairs (X-IFC) considered in this work. 



mixed and weak interference, a simple rate splitting in the HK 
region is optimal to within one bit |T8|. The best outer bound 
may be obtained by intersecting the regions derived by Kramer 
in [lOJ , by Etkin et al. in [ ,18J , and the region independently 
obtained in |[14|, |[T6|, (TTj and later tighten by Etkin in fT9| . 

Few results are available for more than two users and/or for 
non-Gaussian channels. Please refer to j20j, and references 
therein, for a more detailed discussion of the past work that 
we shall only briefly list in the following for sake of space. 

General inner bound regions are lacking. A straightforward 
generalization of the HK approach, whereby each user has a 
different message for every subset of non-intended receivers, 
has a super-exponential complexity in the number of users 
and might be suboptimal in general. In fact, coding schemes 
that deal directly with the effect of the aggregate interference, 
rather than with each interferer separately, as with interference 
alignment | [2T| and with structured codes p2| , are known to 
achieve a larger number of degrees of freedom than simple 
HK schemes for the Gaussian noise channel | [2T| , | |23| , | |24| . 
To the best of the author's knowledge, no outer bounds have 
been developed for the general (i.e., non-Gaussian) IFC with 
more than two users. 

In Gaussian noise, channels with a special structure have 
been investigated such as: the "fully symmetric" channel 
y/7J, |25[ , |[26|, the "cyclic symmetric" channel f2T|, 
the three-user channel with "cyclic mixed strong- very strong" 
interference |29|, and the "one-to-many" and the "many-to- 



16 
1 28 



one" channel [17j , [23l , pQ| . The Degrees of Freedom (DoF) 



of the Gaussian K-IFC has received more attention | |2T| , | [24| , 
|[3T|-p3|; the lesson from the DoF analysis is that structured 
codes appear to outperform purely random codes and that 
the high-SNR analysis is very sensitive to the way the K'^ 
parameters of the K-IFC are let grow to infinity. 

Of direct relevance for this work are the Gaussian 2- 



IFC outer bounds derived by Kramer in |[T0| Th.l] and by 
Etkin et al in fTS'l, which we seek to generalize to any 
memory less channel with any number of users. The basic 
idea is to give side information to the receiver(s) in such a 
way that the resulting bound can be single-letterized, does 
not involve auxiliary random variables and can be computed 
for channels of interest, such as the Gaussian channel. An 
extension of |10, Th.l, first proof] to the i^T-user Gaussian 
channel, with any K > 2, appeared in [26J ; the idea is to 
provide a group of receivers with sufficient side information 
so that they can decode a subset of the users as in a Multiple 
Access Channel (MAC) channel-as also discussed in |25|; the 
resulting optimization problem however does not appears to 
have a closed-form solution in general (a closed form result 
was given in |26J for degraded channels only) and iterative 
algorithms for its numerical evaluation are discussed in p4| . 
In this work, we will approach the problem from a different 
angle: we generalize [10 Th.l, second LMMSE-based proof] 
rather than Th.l, first "general optimization problem'' - 
based proof]. We will show that our bound can be evaluated 
in closed-form for certain Gaussian channels and it is sum- 
capacity for some classes of channels. Extensions of (TSj Th.l] 
to the i^T-user Gaussian channel appeared in pT] , pS] . In 
both works, the receivers are given a side information signal 
that generalizes that of |[T8[ Th.l] whereby entropy terms 
are related by using the entropy power (EPI) |36| and/or the 
extremal inequality (EI) |37 | rather than chosen so that they 
cancel one another. In this work we simply generalize the 
approach of [18, Th.l] to any memoryless channel as it is 
not obvious what EPI and/or EI are for a general channel. 

B. Contributions and Paper Organization 
The main contributions of this work are: 

1) In Section |Il| we derive an outer bound on the capacity 
region of the general memoryless IFC, i.e., not nec- 
essarily Gaussian, with an arbitrary number of source- 
destination pairs. 



2) In Section [III] we specialize the bound derived in Sec- 
tion |II| to the Gaussian channel. In |20|, we showed that 
there exist channel parameters for which our proposed 
bound is the tightest known for the sum-rate of the 
Gaussian 3 -IFC. Here, we derive the sum-capacity of 
certain Z-like Gaussian i^T-IFCs. We also discuss how 
to generalize this sum-capacity result to non-Z Gaussian 
channels; in particular we offer two alternative proofs 
for the sum-capacity of the Gaussian degraded channel 
originally derived in | [26| . 

Section |IV| concludes the paper. Some of the proofs are in the 
Appendix. 
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Fig. 2. Receiver side information for the proof of ThQ] Left: for the bound 
in {la) with *S = [1 : K] and tt = (1, . . . , K). Right: for the bound in {lb). 



II. Main Result 

Theorem 1. The capacity region of a general memoryless K- 
IFC is contained into: 



ok-ifc = u n 

P-Pq nf=i Px^lQ {S,ny.SC[l:K],nen[S] 



k=l 



l(Y^,;X{[nk : 7r|5|])|x([7ro : 7rfe_i]), F([7ro : 7rfc_i]), q) , 

(la) 

Y,Ru<^H{Yk\Y\^J-H{Y\„JYk,Xi,...,XK), \, 
ues k=i ^ 

(lb) 

where the union is over all input distributions P = 
Pxi,...,Xk,Q that factorize as Pq Y[k=i -^Xk\Q inter- 
section is over all subsets S of the user-index set [1 : K] and 
over all permutations tt of the elements of S. The random 
variable Y\j. has the same distribution of Yk\xk f^-^-' 
conditional distribution of an output given its intended input). 
A random variable with subscript ttq is a constant. 

Proof: The details of the proof may be found in the 
Appendix. The key idea is to provide the k-th receiver, 
k e [1 : K], with the side information Sk shown in Fig. [2] ■ 

We remark that: 

1) Th[T] holds for any memoryless IFC and for any number 
of users. 

2) Since the capacity region of a i^T-user IFC does 
not depend on the joint transition probability 
Pyi,...,Yk\Xi,...,Xk (because the receivers cannot 
cooperate), but only on the marginal transition 
probabilities PYk\Xi,...,XK^ k e [1 : K], each bound in 
Th. [T](one for each pair (5,7r)) can be optimized with 
respect to the joint probability Py^^...Xk\Xi,...,Xk 
long as the marginal probabilities are preserved. 

3) The bound in ([Ta| reduces to 1 10, Th.l] for the Gaussian 
2-IFC when A'g = (see ||l0feq.(34)] which inspired 
the side information structure given on the left side of 
Fig.©. 

4) The bound in ([lb]) reduces to jlS] Th.l] for the Gaussian 
2-IFC when A'3 = by setting 5'i = 1\2 and ^2 = 



The bound in ([Tb]) is tighter than |[T8] Th.l] because the 
correlation coefficient between the Gaussian noise of the 
channel output Yj. and the Gaussian noise of the side 
information 1^^^^, {k^iTk) G [1 : K]'^, can be optimized 
so as to get the tightest bound. It is however not tighter 
than the bound independently obtained in [ ,14J , [16J , [17J 
for the 2-user Gaussian channel. 

5) From ([Ta| we get N{K) = Ylk=i i^)^- I'ate bounds. 
For K = 2, the A^(2) = 4 bounds are as in |10, Th.l] 
(two single-rate bounds and two sum-rate bounds). For 
K >3, the N{K)-K'^ bounds that involve at least three 
rates cannot be simply derived by silencing all but two 
users and then by applying |[T0| Th.l] to the resulting 
2-IFC. The number of bounds grows exponentially with 
K: N{3) = 15, 7V(4) = 52, 7V(5) = 325, etc. 
Similarly, from ([Tb]) we get N{K) bounds; those that 
involve at least three rates cannot be obtained by simply 
applying the 2-IFC sum-rate bound in [18, Th.l]. 

6) Th[T]can be easily evaluated. For example, the "Gaussian 
maximizes entropy" f36l suffices to guarantee that a 
jointly Gaussian input is optimal for Gaussian channels. 

7) Th[T] can be extended to other memoryless channels 
without receiver cooperation. For example, the 2-user 
cognitive channel was considered in [ 38J , the 2-IFC with 
a cognitive relay in p9| , and the 2-IFC with generalized 
feedback (a.k.a. source cooperation) in |[4Q|-|[42|. 

For the i^T-IFC with generalized feedback for example, 
Th[T] must be modified as follows: (a) replace each 
channel output with the pair (yfc,lGF,fe), where 
^GF,/c is the channel output observed at transmitter 
k, k G [1 : K], (h) consider the union over all 
possible joint input distributions Pxi,...,Xk (because the 
generalized feedback enables source cooperation which 
results in correlated inputs); (c) choose the worst joint 
transition probability Pyi,...,y^|Xi,...,x^ 
that preserves the marginals Py^ \Xi,... ,Xk 
ke[l: K]. 

Similar extensions are possible for other channels. 

8) For the Gaussian 2-IFC, besides the bounds in [lOj 
and in |T8| that we generalized in Thj^the following 
outer bounds are known: fTO', Th.2] g^l, (TSj, (TTl 
and p9| . These bounds are tighter than |18 Th.l] for 
some weak interference parameters. It is left for future 
work to generalize these 2-user Gaussian channel bounds 
to non-Gaussian channels with more than two users. 
We note that the common feature of these bounds is 
to generalize the class of genie signals of |18, Th.l] by 
relating entropy terms rather than canceling them (see 
proof of Th[T]); this is done by using the entropy power 
(EPI) ||36| and/or the extremal inequality (EI) |37j |; the 
extension of the EPI and/or the EI to general (i.e., non- 
Gaussian) channels is not trivial. 

III. Gaussian channels 



,^GF,l 
rGF,l, 
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the sum-capacity for certain Z-channels (subsection III-B). 
We conclude with Subsection IIII-CI where we discuss how 
to extend the result of Subsection |III-B| to non-Z channels; 
in doing so we show that Th[T] eq. ([Ta| gives the sum-rate 
capacity of degraded channels, thereby providing an alternative 
proof for the result of [26]; we also offer an alternate proof 
for the sum-rate capacity of the degraded Gaussian K-IFC by 
generalizing an argument originally devised by Sato for the 
degraded Gaussian 2-IFC |8|. 

A. The Gaussian Channel Model 

A SISO (single input single output) complex-valued Gaus- 
sian i^T-IFC in standard form has outputs: 



Y,, 



K 

E 

k=i 



with input power constraint E[|X^p] < 1 and noise Zi ~ 
A/'(0,1), i G [1 : K]. The correlation among the Gaussian 
noises is irrelevant since the capacity only depends on the 
marginal noise distributions. The channel gains are fixed and 
are known to all terminals. Without loss of generality, the di- 
rect link gains hi^i, i G [1 : K], can be taken to be real- valued 
(because receiver i can compensate for the phase of hi^i) and 
strictly positive (if p = then the SNR at receiver i is zero 
even in absence of interference, which implies that Ri = 
is optimal, i.e., the system has effectively one less user). The 
Gaussian i^T-IFC is completely specified by the channel matrix 
H:[H],^j=h,^j,{iJ)e[l:K]x[l:K]. 

In the following we adopted the Matlab-like convention that 
H-jz^c in the \7l\ x \C\ matrix obtained from H by retaining 
the rows indexed by TZ and the columns indexed by C 

B. Sum-capacity of Z- like channels 

Here we consider a class of Gaussian i^-IFCs for which the 
channel matrix H is upper triangular. This class of channels 
can be thought of as the multi-user generalization of the 2- 
IFC Z-channel fTSj . The following theorem establishes the 
sum-capacity for a subset of Z-channels for which treating 
interference as noise is optimal: 

Theorem 2. Consider a K x K noise covariance matrix 
defined recursively as follows: let J2i = [1] and V/c = 
2,...,K let 

^k-l Pk-l\ ^ ^ ^k-l . _ H y ^ 

Pk-1 ^ J 

Consider a channel matrix H whose upper triangular part is 
defined recursively as follows: for k = . . . ,2 



Hi 



[kUk] 



H[l:k-l],[k] - II rr_ ^ l|2 

{Pk-1 + H[i,k-l],[k^l:K]H[k]^[k^l:K]) , (2) 



In this section we first introduce the Gaussian channel model 
(subsection III-A ). We then show that Th[T] eq. ([Ta| gives 



while the entries below the main diagonal of H are zero. 
For the channel defined by the sum-rate capacity is given 



by ([Ta| and equals: 

K 

Y,Rk< >:iog| 1 



K 

fe=i 



\h 



kM 



k=i k=i \ 1 + i^fc^^p y 

Proof: Since every mutual information term in ([Ta| 
contains all the inputs, the "Gaussian maximizes entropy" 
principle |36| assures that iid A/^(0, 1) inputs are optimal. 
Consider S = [1 : K] with tt = (1, . . . , in ([Ta| and rewrite 
the sum-rate as: 



(3) 



k=i [k=i J 



(4) 



hi^k) = i{Yu . . . , life-i, . . . , Xk-i). (5) 

The channel matrix H defined by ^ is such that for each 

that is, conditioned on (Xi, . . . , X/e_i) the set of outputs 
(Yi, . . . , y/e_i) is a degraded version of Yj. and thus: 




rk' 



(6) 



By summing the rates in ([6]) over all /c G [1 : i^T] we obtain 
the sum-rate upper bound in ([3]). The upper bound in ([Sj can 
be achieved by simply treating interference as noise at each 
receiver (recall that for the Z-channel, the k-th receiver is 
interfered by (X^+i, . . . , Xk) only). ■ 
By considering all possible covariance matrices T,k, Th[2] 
identifies a novel class of channels for which treating interfer- 
ence as noise is sum-rate optimal (besides those in (17] Th.4, 
Th.5, and Th.7] and (SSj Th.3]) as shown in the following 
examples. The correspondence between channel matrices and 
noise covariance matrices given by ^ is interesting in itself 
and deserves further analysis. 

Example 1. Th|2] assures that treating interference as noise 
is optimal for all channels that can be built as in ^ from a 
covariance matrix of the type: 



/I 



V2 



Vk 



i-k-1 



K 



J 



K 

E 

k=2 



Vk? < 1. 



The resulting channel has gains hi^k = Vkhk^k^ k = 2^ 
and zero for the remaining non-diagonal entries; this channel 
is to the so-called many-to-one channel |23|. The condition 



2^k=2 



Vk\ < 1 identifies a subset of many-to-one channels 



for which treating interference as noise is optimal. The con- 
dition ^f_2 l^/cP < 1 is equivalent to ^ Th.4], thus Th^ 
generalizes (TTj Th.4]. 

The relationship between the class of channels identified by 
Th|2]and that identified by (35] Th.3] (of which |17j Th.4 and 



Th.5] are special cases) is subject of current investigation. We 
note that 1 35 , Th.3] is obtained from a generalization of (TSj 
Th.l] while Th|2]from a generalization of 1 10, Th.l], it is thus 
possible that |35, Th.3] and Th|2]do not imply one another. 

Example 2. Consider channels that can be obtained as in ^ 
from a rank-one covariance matrix of the type: 



K\i,j 



: ai/aj 



for some G C^. The resulting channel has 

entries hij = hj^ai/aj, j G : K] and i e [1 : K], and zero 
for the remaining non-diagonal entries. For these channels the 
sum-capacity is: 



K 



\cik? \hk,k/cik\' 



(7) 



In the next subsection we relate the channels considered in 
Example 2 with the class of degraded channels studied in [261 

C. Sum- capacity of non-Z channels 

The condition expressed by ([2]) is only sufficient for the 
achievability of ([3]). In general, the expression in ^ is an 
upper bound to the sum-capacity of channels for which the 
upper triangular part of H can be expressed as in ([2]) and the 
entries below the main diagonal have any arbitrary value. For 
such channels, the entries below the main diagonal might need 
to satisfy some extra constraints (besides ([2])) in order for ([3]) 
to be achievable. The following discusses an example of such 
extra constraints. 

The expression in ^ suggests the following achievable 
strategy for non-Z channels: the k-\h receiver first decodes 
users 1, . . . , /c — 1, then strip them from its received signal, 
and finally decodes its intended message by treating the signal 
of users /c + 1, . . . , K as noise; with this "successive decoding 
strategy" the rate-tuplet (ri, . . . , r^) in ([6]) is achievable if: 

Theorem 3. The sum-rate in ^ is achievable for a channel 
H such that the upper triangular part of H can be expressed 
as in ^ and such that the entries below the main diagonal 
satisfy: 

ke[l:K] 5fcC[l:fe-l] 

for {n,..., Vk) defined in 



Proof: The proof follows from the previous discussion. 
The achievable region in Th|3]is the intersection of K MAC 
regions where only the constraints that involve the intended 
rate have been retained. ■ 

As a corollary to Th[3] we have: 

Corollary 4. Degraded channels, that is, channels for which 
H has rank one, satisfy the assumptions of TTzjj] 



Proof: Consider a K-WC with unit rank channel matrix 
H = ab^ , for some i^T-length column vectors a and b (notice 
that these channels have upper triangular part as in Example 2 
with bl = hk^k/cik)- Without loss of generality, assume that 
the entries of the vector a satisfy |ai| < |a2|... < |ax|. With 
this ordering the channel outputs from a Markov chain: 



X 

k=l 

dkXeci + ^/c, k e[l : K]. 



For this channel, clearly the k-th decoder can decode all users 
with index i < k without imposing any rate penalty to these 
users; thus sum-rate in ^ is achievable. ■ 
Corollary]?] offers a simple proof for the sum-capacity result 
of [261 ; this implies that Th|3] generalizes the result of |26|. 

Another proof for the converse part of Corollary |4] can be 
obtained by generalizing the bound for the degraded Gaussian 
2-IFC proposed by Sato in We have: 

Theorem 5. The capacity of the Gaussian K-IFC with chan- 
nel matrix H = ab^ , such that \ai\ < |a2|... < |ax|, is outer 
bounded by: 



Rk < log 1 



Mhf\ak\ 



(8) 



jj^k+l ^3 



for all > 0, /c G [1 : K], such that Xl/c=i = 1- 

Proof: By letting the transmitters cooperate, the capacity 
of the unit-rank K-\¥C is outer bounded by the capacity of 
a i^T-user degraded SISO Broadcast Channel (BC) with input 
^eq = Ef=i K^k, input power constraint E[Xeqp] < \\bf, 
and outputs Yk = cikX^^ -\- Zk, k e [I : K]. The capacity of 
this degraded K-user BC is given by ^ |36|. ■ 
By using the outer bound in Th|5]we have the following very 
simple proof for the converse part of Corollary |4j by letting 
/^nll^lP = l^/cP in ([8]) we immediately obtain the upper bound 
in ([3]), which is equivalent to ^ with = hk^k/dk- 

IV. Conclusions 

In this work we developed a framework to derive an outer 
bound for the general memory less interference channel with an 
arbitrary number of source-destination pairs. For the Gaussian 
channel, we showed that the proposed bound gives the sum- 
capacity for certain channels, including some Z-channels and 
degraded channels. 

Appendix 

Proof of ([Ta|. Consider a non-empty subset S of [1 : K] 
and let be its complement in [1 : i^]. Consider without loss 
of generality the permutation tt = (1,2,.. .J^l) (the others 
are obtained by relabeling the users). We use the following 



conventions: for a set 5, W{S) = {Wi : i G S}. We have: 

\s\ \s\ 
n Y.{Rk-en)^^Y.^{X]:',^^) 

fc=l k=l 



(b) 



\S\ 



S 2^ ^ [y^k ^^k ^ J^i 5 • • • 5 ^k-i^ -^1 5 • • • , ^k-i^ ^ )) 

k=l 

— 2^j-\^k 5 5 -^1 5 • • • 7 -^fc-il^i 5 • • • ^^k-1^ ^ \^ )) 

k=l 
\^\ k 

k=l £^1 
\S\ \S\ 



(e) 



£ , . . . ,^k-l) 



£=1 k=£ 



(f) V- 



\S\ 



(g) 



\s\ 



< n^/(X,,...,X|5|;r,|Xi,ri,...,X,_i,r,_i,X(5^),Q) 

where the different (in)equalities follow from: (a) Fano's 
inequality, (b) non-negativity of mutual information (i.e., add 
side information at the receivers as in Fig. [2]), (c) independence 
of messages (and thus of codewords), (d) chain rule for mutual 
information, (e) swap order of summation, (f) chain rule 
for mutual information, (g) "conditioning reduces entropy", 
memoryless property of the channel, and by introducing a 
"time sharing" random variable Q uniformly distributed on 
[1 : n] and independent of everything else. 

Proof of ([lb]). Consider a non-empty subset 5 of [1 : i^T], 

Y,n{Ru-e^)<Y,I{X^;Yn 

kes kes 

<Y,I{Xt,Y^,Sl) 

kes 

= H{S]:) + HiY^Sl) - HiY^Xl) - H{S^\X^, Y^) 



kes 



(a: 



< ^i7(lT|5,-)-i7(5,-|r,^x^...,x^), 



kes 



where the inequality in (a) requires: 

J2{H{S^)-H{Y,"\X^)-I{S^;X^,...,Xi\X]:,Y,-)) < 0. 



kes 



A good candidate for the side information is as in Fig. [2] 
inspired by (TSj 



{Sk, keS} = k e 5}, V5 c [1 : K], 
where we defined Y\j^ ^Yk\xk- 
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